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ABSTRACT 


Temporal information is pervasive and crucial in medical records and other clinical text, as it formulates 
the development process of medical conditions and is vital for clinical decision making. However, providing 
a holistic knowledge representation and reasoning framework for various time expressions in the clinical text 
is challenging. In order to capture complex temporal semantics in clinical text, we propose a novel Clinical 
Time Ontology (CTO) as an extension from OWL framework. More specifically, we identified eight time- 
related problems in clinical text and created 11 core temporal classes to conceptualize the fuzzy time, cyclic 
time, irregular time, negations and other complex aspects of clinical time. Then, we extended Allen’s and 
TEO’s temporal relations and defined the relation concept description between complex and simple time. 
Simultaneously, we provided a formulaic and graphical presentation of complex time and complex time 
relationships. We carried out empirical study on the expressiveness and usability of CTO using real-world 
healthcare datasets. Finally, experiment results demonstrate that CTO could faithfully represent and reason 
over 93% of the temporal expressions, and it can cover a wider range of time-related classes in clinical 
domain. 


* These authors contributed equally to this work. 
t Corresponding author: Fangfang Xu (E-mail: xuff@wust.edu.cn; ORCID: 0000-0003-1057-6444). 
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1. INTRODUCTION 


Time-related information is essential part of in-patient medical records, doctor prescriptions and other 
clinical text. It describes the sequence of symptoms and treatments received by patients and is crucial in 
clinical decision-making and medical research [1, 2]. However, natural language expressions about time 
and temporal relation concepts are diverse and can vary with different contexts and writers. Current research 
on temporal knowledge representation in text mainly focused on news or open domain [3, 4, 5]. However, 
due to the distinct features in the medical domain, the research results from the news domain or open 
domain cannot be applied directly to the medical domain [6]. 


Temporal knowledge representation is challenging and has been discussed extensively. Generally, 
OWL-Time [7] can be used to represent instant and interval, and possibly Allen’s 13 temporal relations [8]. 
Reasoning engine CHRONOS [9] can be used for relation inference, inconsistency detection and path 
consistency check. However, these ontologies do not cover some important features, such as irregular, 
granular time, and modality. These ontologies designed for general domains are not adequate enough for 
specific domains such as healthcare [10]. To fill the gap between general tools and the clinical domain, 
Tao et al. [11] proposed temporal ontology CNTRO. After that, Li et al. [12] extended it and proposed the 
TEO ontology. They leveraged a Java-based TEO reasoner to realize complex timeline reasoning. We can 
use it to represent granular time, instant, interval and incomplete time. However, there is also some uncertain 
and irregular time in the clinical domain, such as "take pills around 6 a.m.,” and "The patient received 
rehabilitation training twice a day last year, and once every 2 days this year.” The former describes an 
uncertain time. There is no precise time point for the occurrence of an event, but there is still a rough range. 
For example, ”4 p.m.” is not in the range. The latter one describes an irregular time, and it represents the 
phased change of the event. 


Another challenge is the representation of inference queries with complex time. For example, if we have 
known that Patient A had surgery yesterday and was advised by his doctor to do 20 minutes of rehab 
around 9:00 am every morning,” then for the question, “What should Patient A do at 9:05 am this morning?” 
Humans can quickly answer it. Nevertheless, all of those ontologies mentioned above cannot get the 
answer. Based on this, we proposed Clinical Time Ontology (CTO), which aims to represent the complex 
temporal information in clinical texts. 


The contribution of this paper can be summarized as follows: 


1) We defined 11 core time classes, and proposed a robust ontology modeling complex temporal 
concepts and relations; 

2) We introduced the concept of complex time related classes and the concept of complex temporal 
relations using the temporal logic language and depicted them in the form of diagrams and formulas; 

3) We invited three annotators to annotate the real data, and evaluated the result by IAA metrics and 
the coverage ability of the ontology. And The results show that the CTO ontology can cover a wider 
range of time-related classes and temporal relationships in the medical record data, and it can 
faithfully represent 93% time-related classes and temporal relationships. 
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The remainder of this paper is organized as follows: Section 2 discusses the related work; Section 3 
summarizes eight types of time-related problems in clinical domain and also introduces 11 core time 
classes; Section 4 elaborates CTO ontology’s design principles; Section 5 presents the definition of complex 
time classes and complex time relations in terms of semantics; Section 6 depicts data sets, experiments and 
comparisons with other ontologies about representation; Section 7 summarizes of this paper. 


2. RELATED WORK 


The representation of time information has been discussed extensively in Semantic Web. In this paper, 
we discuss general purpose of temporal ontologies and medical domain temporal ontologies separately. 


2.1 General Domain Temporal Ontologies 


Hobbs et al. [13] proposed the DAML ontology of time that integrates first-order predicate calculus for 
topological temporal relations such as intervals, events, dates and times. Based on which, OWL-Time 
ontology [7] provides vocabularies for instant and interval, duration, datetime, and Allen’s time relations 
[8] in terms of time ontologies. However, OWL-Time focuses on definitive, accurate timestamp-based time 
representation and cannot cater to complicated time expressions (e.g., periodic time, fuzzy time, and multi- 
granularity time representation). 


SWRL temporal ontology [14] defined OWL entities to represent interval-based time information in OWL, 
and provided OWL entities to represent all entities defined by the model. It contains propositions, effective 
time (instants and interval), granularity and duration, and also contains a set of SWRL built-in functions that 
can be used to reason about time information. Eleftherios et al. [9] proposed CHRONOS, a system for 
reasoning about time information in OWL ontologies. CHRONOS provides both qualitative and quantitative 
information representation, enabling inferring implied relations, detecting inconsistencies, and ensuring 
path consistency. Vadim et al. [15] proposed PSI-Time Ontology, which can represent concepts of some 
complex time such as relativist and absolutist duration, periodic time intervals, interval phases, open time, 
linear time, and the relations, point-to-point, point-to-interval and interval-to-interval. However, they do 
not cover all time-related aspects in clinical domain, and are difficult to link to structured or unstructured 
clinical data. 


2.2 Clinical Domain Temporal Ontologies 


Tao et al. [11] proposed the Clinical Narrative Temporal Relations Ontology (CNTRO), which aims to fill 
the semantic gap in temporal modeling in the clinical domain. It provides vocabularies such as Event, Time, 
Duration, Granularity, Precision, and TemporalRelationStatement. Li et al. [12] extended CNTRO to develop 
a novel Temporal Event Ontology (TEO), which can formally represent and reason about structured and 
unstructured time information, supporting flexible temporal relationship representation and reasoning. 
However, TEO is still insufficient in terms of representing and reasoning on fuzzy time, irregular time, and 
complex relationships. 
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Representation of complex time (e.g., fuzzy time, irregular collection time, etc.), subjectivity, negation, 
and complex temporal relationships is significant challenge for temporal ontologies. Based on this, we 
propose Clinical Time Ontology (CTO), which aims to achieve inference and query complex temporal 
information problems and mine more relationships between them. 


3. TEMPORAL EXPRESSIONS IN CLINICAL TEXT 


Temporal information is pervasive in clinical data. However, it is nontrivial to capture temporal information 
in unstructured text from clinical data, which is pivotal to healthcare decision making. For example, a 
patient’s self-description and previous medical history can be obtained from the medical record data. It 
could take the form of a paragraph in natural language, and healthcare providers need to digest this 
paragraph to understand the chronological events it implies, so as to make a diagnosis or treatment plan. 
Formulating a semantic framework for time-related expressions in clinical text is a first step towards 
automated reasoning and decision support. In this section, we first categorize the time-related expressions 
in typical clinical text, and then we provide a set of key time concepts. 


3.1 Complicated Expressions of Time in Clinical Text 


We refer to TIDES [16] for a basic classification of temporal expressions, then we further refine the 
categorization of temporal expressions in the clinical domain based on our observation over 3.5K 
anonymized real-world clinical records. We identified different types of time descriptions from the clinical 
text according to different temporal aspects, including completeness, accuracy, repetition, subjectivity, etc. 
Regarding to the completeness, there can be complete or incomplete time; to the accuracy, there can be 
accurate or fuzzy time; to the repetition, there can be cyclic or irregular time. And based on this, we divided 
time into 5 types (e.g., absolute time, lack information time, fuzzy time, cyclic time, and irregular time). 
And then, we also obtained three qualitative questions, such as questions of subjectivity (e.g., whether it 
is worth believing), questions of temporal relationships (e.g., before, after, etc.), and questions of negation 
temporal relationships (e.g., not before, not after, etc.). In general, we summarized eight problems, which 
are shown in Table 1. 


“Absolute time” is a point or duration that can be represented separately on the time axis; “Lack 
information time” means the sentence which describes the time information about the event is incomplete, 
and it includes two subcategories, which are "relative time” and “incomplete time.” The former means that 
we need to find the reference time and get the real time by calculating. The latter is commonly used to 
describe a time interval but without complete information (e.g., lack start time or end time); “Fuzzy time” 
indicates that there exist approximate and vague modifiers in sentence, and it may be an instant or an 
interval; “Cyclic time” means that the event is repeated and the occurrence is regular, and it may be a 
collection of multiple instants or intervals; “Irregular time” means that the event is repeated, but not regular, 
and its items may be instants or intervals or collections. 
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Table 1. Classification and examples of time-related problems in clinical texts. 


Id Type Example Answer 
1 Absolute Time Problems When was Patient-A hospitalized? July 1, 2021 
2 Lack Information Time Problems (Patient-A is known that have been hospitalized on June 28, 2021 
(Relative Time and Incomplete July 1, 2021, and developed symptoms 3 days 
Time) prior to hospitalization.) When did Patient-A 
present with symptoms? 
3 Fuzzy Time Problems (Patient-A is known to have surgery on July 1, July 4, 2021 


2021, and people usually recover as early as 3 
days and as late as 5 days after surgery) When is 
the earliest Patient-A can be discharged? 


4 Cyclic Time Problems When is the following review of Patient- A? July 3, 2021 

5 Irregular Time Problems When did Patient-A‘s first course of treatment end? July 1, 2021 

6 Subjective Problems Is this time subjectively perceived by the patient or Patient subjectively 
objectively recorded? perceived 

7 Temporal Relation Problems (Patient-A is known to be hospitalized on July 1, Patient-A 


2021, and Patient-B is known to be hospitalized on 
July 12, 2021) Who was hospitalized first, Patient A 
or B? 
8 Negation Problems Whose operation event is not before Patient-A’s? Patient-B, Patient-C 


3.2 Time Types 


The temporal expressions listed in Table 1 can be addressed using a combination of syntactic forms and/ 
or semantic concepts. Syntactically, all temporal expressions can use 1, 2 or more timestamps to denote 
the relevant instance, interval or repetitions, additionally, temporal concepts and relations can have modifiers 
to express beliefs or polarity. Semantically, a temporal concept can be absolute, fuzzy, relative, or periodic, 
etc. Based on the first five problems in Table 1, we categorized the time. Problems 6 and 8 motivate the 
subjective and negation modifiers. We analyze the combinations of the above syntactical and semantic axis 
of temporal concepts and relations and organize them in Table 2 with correlations to the temporal expression 
types listed in Table 1. 


“Absolute Instant” is a point in time, with no fuzzy (e.g., about, server) and relative modifiers (e.g., ago, 
after); “Relative Instant” is a point in time, but with relative modifiers; “Fuzzy Instant” is a point in time, 
but with fuzzy modifiers; 


“Absolute Interval” is a duration in time, whose start and end time are “Absolute Instant” instances; 
“Relative Interval” is a duration in time in which at least one of the start or end time is a “Relative Instant” 
instance; "Fuzzy Interval” is a duration in time, and the condition is satisfied as long as one of its three 
components (duration quantity, start time and end time) is with a fuzzy modifier, even though its start time 
may make it like a “Relative Interval” instance; “Incomplete Interval” is a duration in time with only one 
of the three components, whether or not the component is with a modifier; 
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Table 2. Time types obtained by combinations of the syntactical and semantic axis of temporal concepts and 
relations. 


Syntactic Forms Semantic Concepts Example Related Problems in Tablet 
Instant Absolute Instant January 1, 2021 10:20:30. Type 1 
Relative Instant This time yesterday. Type 2 
Fuzzy Instant About 6:00 this morning. Type 3 
Interval Absolute Interval January 1, 2021, to February 1, 2021. Type 1 
Relative Interval Yesterday from 9:00 to 10:00. Type 2 
Fuzzy Interval The patient may have started exercising at Type 3 
9:00 and finished around 10:00. 
Incomplete Interval The patient stopped taking the medication Type 2 
yesterday. 
Collection Periodic Instant Collection Every morning at 8 am. Type 4 
Irregular Instant Collection The patient had vomited at 10:00 am, Type 5 
11:00 am, and 3:00 pm today. 
Periodic Interval Collection Once a day, 15 minutes each time. Type 4 
Irregular Interval Collection The patient started yesterday at 20:00 and Type 5 


exercised for an hour. Today he started at 
19:00 and only exercised for half an hour. 


Modifier Negation Not during this week. Type 8 
Subjective The time is subjectively described by the Type 6 
patient. 


“Periodic Instant Collection” is a collection of points in time, which is commonly used to describe the 
time for the events in cycle, and occurrence time is also a point in time; “Irregular Instant Collection” is a 
collection of points in time, commonly used to describe the time for events with irregular occurrence time, 
and the occurrence time is a point in time, and its item can also be some “Periodic Instant Collection” 
instances; “Periodic Interval Collection” and “Periodic Instant Collection” are more similar, the difference 
is that the occurrence time of the former is a duration; “Irregular Interval Collection” is similar with 
“Irregular Instant Collection,” but the occurrence time is a duration, and its item can also be some “Periodic 
Interval Collection” instances. 


We divided the time appearing in the clinical field into the 11 main types mentioned above. And there 
is no intersection between them. For negation and subjective modifiers, we treat them as a property of 
temporal instances. 


4. CTO ONTOLOGY 


We developed the Clinical Time Ontology (CTO) for semantic representation and reasoning over the 
temporal aspects discussed in Section 3. In this section, we elaborate on its module design. 
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4.1 Overview 


Temporal concepts cannot be utilized alone, so we need to bind them with the concept of Events. 
Therefore, we defined the class Event to represent clinical events. It has sub-classes such as 
HospitalizationEvent, DischargeEvent, etc. We can extend the list of sub-classes upon requirements. 
According to Table 2, we can obtain 11 classes, such as Absolutelnstant, Relativelnstant, Fuzzylnstant, 
Absolutelnterval, etc. They are both the sub-class of Time. Class Event and class Time can be linked 
by object property hasTime. We refer to the OWL-Time and TEO ontologies when designing our ontology. 
We have some common and different concepts. For example, class Absolute/nstant is the same as Instant 
in OWL-Time, class TimeQuantity is the same as Duration in TEO. However, class /rregularinstanCollection 
does not exist in their ontologies. Figure 1 shows the taxonomy of CTO and the relationship with OWL- 
Time and TEO (using sameAs). 
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Figure 1. Core Taxonomy of Clinical Temporal Ontology. 


4.2 Module Design 


We divided time into four modules, which are /nstant, Interval, InstantCollection, and IntervalCollection. 
Figure 1 illustrates this, and more details can be obtained as followed: 
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4.2.1 Instant 


Class Instant has three subclasses: Absolute/nstant, Relativelnstant, and Fuzzy/nstant, they are respectively 
designed for absolute instant time, relative instance time, and fuzzy instance time. There exist a number of 
properties that describe time at a granular level, such as second, minute, hour. Class TimeUnit is designed 
for representing time’s precision, which means the smallest granularity unit of time. 


Object property prefixAnchor is designed to give a qualitative representation of the relationship between 
the moment and its reference moment. Object property relativeQuantity can quantitatively represent the 
length of difference. For example, “Hospitalized on July 1, 2021, and started showing significant symptoms 
three days ago.” “Three days ago” is a relative time description, we can set it to <relativelnstant1>. 
“July 1, 2021” is an Absolutelnstant instance, and it is the prefixAnchor of <relativelnstant?>. And the 
relativeQuantity of <relativelnstant1> is “3 days.” Figure 2 illustrates this case. 


type; type! 


prefixAnchor 
— ri 
- relativeQuantity = 
comment’ “sy comment 
SS 
Figure 2. Relative Time. 


For fuzzy time, object property /eftAnchor can indicate its earliest possible occurrence time, and 
rightAnchor can indicate the latest one. For example, “Early January” is a fuzzy description, and we can 
set its leftAnchor to <absolute/nstant1>, which for 1, January,” and set its rightAnchor to <absolute/nstant2>, 
which for “10, January.” Figure 3 illustrates this case. 
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Figure 3. Fuzzy Instant. 
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4.2.2 Interval 


Class Interval has four subclasses: Abso/ute/nterval, Relativelnterval, Fuzzylnterval, and Incomplete/nterval. 
The former two both have object property startTime, endTime, and durationQuantity. The differences 
between them are that their startTime’s and endTime’s ranges are not the same class. 


Object property minDurationQuantity and object property maxDurationQuantity are designed for fuzzy 
interval, and they indicate the minimum possible duration and the maximum possible duration respectively. 
For example, “it started on 1, January, and lasted for about half a month.” Figure 4 illustrates this case. 
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Figure 4. Fuzzy Interval. 


Class Incompleteinterval is a special case of Interval, which just has startTime or endTime and does not 
have durationQuantity. 


4.2.3 InstantCollection and IntervalCollection 


The temporal description of medication and check-up events in clinical medical texts is cyclical. For 
example, “From today, review daily at 2:00 pm for five days.” In addition, some irregular time collections 
still exist in the medical texts, such as ‘The patient received rehabilitation training twice a day last year, 
and once every 2 days this year.” Each occurrence may be instant, interval or collection. Therefore, we 
designed class InstantCollection and class InrevalCollection. Class InstantCollection has two subclasses: 
PeriodicInstantCollection and IrregularinstantCollection, and class InrevalCollection has two subclasses: 
PeriodicIntervalCollection and IrregularintervalCollection. 


Object property periodicQuantity can quantitatively represent the cycle, and data property periodicCount 
can represent the number of cycles. Object property eachTime are designed to represent the time of 
each occurrence, which range can be class /nstant or Interval. And we can use periodicQuantity with 
a TimeQuantity instance to represent “daily,” and use durationQuantity with a TimeQuantity instance to 
represent “for five days.” Figure 5 illustrates this case. 
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Figure 5. Periodic Time. 


We can implement the representation of irregular time in two steps as follows: 1) create some Time 
instances; 2) connect them using object property belongTo or hasSubset and data property item/ndex. 
The Time instance can be not only /nstant or Interval, but also InstantCollection or IntervalCollection. 
We can set the second sentence mentioned above to <irregularinstantCollection1>. There are two stages, 
and both of them are cyclic time. They can be represented by two instances of PeriodicinstantCollection: 
<periodicinstantCollection1> and <periodicinstantCollection2>. And then, use object property hasSubset 
to connect them. Figure 6 illustrates this case. 
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Figure 6. Irregular Time. 


4.2.4 Subjective Time 


In clinical texts, it is widespread that patients describe the occurrence time of events by themselves. In 
this case, the time in the text is subjective and maybe not the real-time of the event. To represent subjectivity, 
each Time instance can use the data property subjective with True or False value to indicate whether it is 
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a subjective description of the patient or an objective record so that the physician can make some judgments 
about that time information. Figure 7 illustrates this. 


Figure 7. Subjective Time. 


4.2.5 Temporal Relationship 


Besides aforementioned cases, there are also some cases that indicate the temporal relationship between 
the events, e.g., “receiving treatment B is earlier than taking medicine A.” Allen’s 13 temporal relationships 
can be used to represent partial situations, however, they cannot cover all cases, such as point-to-collection, 
duration-to-collection, and collection-to-collection situations. Therefore, to fill the gap, we extended them 
by additions, such as containAll, containNull, and their inverse properties. Figure 8 illustrates a sample. 
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<> een 
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Figure 8. Temporal Relationship. 


4.2.6 Negation 


The application of chronological relations sometimes appears with his negation. For example, “taking 
medicine A is not earlier than the end of treatment B.” In the medical record data, as long as we cannot 
know or infer that “taking medicine A” before “treatment B,” we can assume that A is not before B. And 
based on the closed world assumption (CWA), we designed the algorithm not, which can be used to 
represent the negation of a temporal relationship, such as not before, not after, and not overlap, etc. 
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5. CTO SEMANTICS 


This section will introduce class concepts and temporal relationship concepts semantically and graphically. 
Formally, Baader and Hanschke [17] proposed an extension to ALC, which named ALC(D). We describe 
the complex time classes and complex temporal relationships in the scheme of this language, and the 
reasoner can implement inductive reasoning according to the descriptions. 


5.1 Complex Time Classes Definition 


We defined some new classes for representing complex time, e.g., PeriodiclnstantCollection, 
PeriodicintervalCollection, etc. Their descriptions are shown below. 
IncompleteInterval = (Astart-time N fend-time) U (Gend-time N fstart-time) 
PeriodicInstantCollection = Astart-time.Instant N Aperiodic-quantity.Quantity 
N deach-time.Instant  (Aend Otime U Aperiodic-count) 
IrregularInstantCollection = first. Instant N Asecond Instant N --- N An Instant 
Astart-time.Instant N Aperiodic-quantity.Quantity N 
deach-time.Interval N (Jend-time U Aperiodic-count) 
PeriodicIntervalCollection = Astart-time.Instant N Aperiodic-quantity.Quantity N 
Jeach-time.Interval N (Jend-time U Jperiodic-count) 


IrregularIntervalCollection = Jfirst.Interval N Asecond Interval N --- N 3n, Interval 


PeriodicCollection = PeriodicInstantCollection U PeriodicIntervalCollection 
IrregularCollection = IrregularInstantCollection U IrregularIntervalCollection 
InstantCollection = PeriodicInsntantCollection U IrregularInstantCollection 
IntervalCollection = PeriodicIntervalCollection U IrregularIntervalCollection 
Collection = InstanCollection U IntervalCollection 


Pair = (Sformer.Instant N Jlatter.Instant) U 


(Aformer Instant N Jlatter.Interval) 


(Əformer.IrregularIntervalCollection N Jlatter.IrregularIntervalCollection) 


The description of concept PeriodicInstantCollection means its instance has start-time (Instant instance), 
periodic-quantity (Quantity instance), and each-time (Instant instance). And it is worth noting that it may 
have end-time (Instant instance) or periodic-count (Integer instance). Quantity denotes the quantification 
distance between one time and another. Pair means the two objects are used to calculate the relationship 
(One-by-one combinations between all time-related types). 
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5.2 Complex Temporal Relation Definition 


We extended Allen’s 13 temporal relationships to cover more comparisons between temporal types, such 
as startPoint, containAll, etc.We can determine whether the time of two events is completely covered or 
intersected or unrelated based on whether there is a relationship a//During, someDuring or nullDuring 
between the time of the two events. before, after, meet is a special case of nu//During, but there are more 
than just these three cases of nu//During; there is also a case that is similar to overlap, but it may be that 
each element of the two collections alternates without intersecting each other. These three temporal 
relationships are mainly for Collection’s subclasses. startPoint and finishPoint are mainly for Instant’s 
subclasses, however, start and finish are mainly for Interval’s and IntervalCollection’s subclasses. 


All temporal relationships are shown in Figure 9. We use different colors to distinguish between a, b 
objects, simple and complex time classes, as well as Allen’s, TEO’s and the CTO’s temporal relationships. 
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a endAfterEnd b 4 
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a Ca CG 
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Figure 9. Temporal Relation Semantics in Allen’s, TEO and CTO. 
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Table 3. Description of temporal relation of CTO using ALC(D). 


F! 


Id Concepts 


Description 


before 
2 during 


3 allDuring 


4 someDuring 


5 nullDuring 


6 startPoint 


7 finish Point 


Pair N (EndTime( former) < StartTime(latter )) 
(StartTime (latter Interval ) < StartTime( former Interval )) 
i (EndTime( former.Interval) < EndTime(latter.Interval )) | 
— (ei (/atter.Interval ) < former.Instant ) | 


N ( former Instant < EndTime (latter.Interval )) 


VAllitems ( former.Collection) during 
VAllItems (latter Collection) 


Pair N 
JAllItems (latter. Ci ollection) 


P JAllItems ( former.Collection) during 
Pair N l 


PAllitems (latter.Collection) 


CalTime( former.Instant, y 
StartTime (latter Interval ) 


Pair N 


V/Allitems ( former.Collection) during ) 
ia alTime ( formerJInstant,@) = F 


fa Time (latter.IntervalCollection | 


EndTime (latter.Interval ) 
Pair N 


f pa former.Instant, ð) = | 


CalTime( former.Instant, Ø) ) 


= EndTime (latter.IntervalCollection) 


In Table 3, we used 3 auxiliary functions: Allitems(), CalTime() and StartTime(). The semantics of these 
functions are provided in Appendix A. 


5.3 Negation Temporal Relation Definition 


If the reasoning machine can infer that there exists a temporal relation r1 between time A and time B, 
but cannot conclude that there exists temporal relation r2, then according to CWA, there is a negation 


relation not r2 between the two times. For example, if the reasoner cannot reason out <time1> before 


<time2>, <time1> not before <time2> can be concluded. All of the temporal relationships can get their 


negations. Table 4 introduces some examples. 
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Table 4. Description of negated temporal relation of CTO using ALC(D). 


Id Concepts Description 

8 not before Pair N (former before latter) 
not equal Pair N (Bformer equal latter) 

10 not meet Pair N (former meet latter) 


6. EMPIRICAL STUDY 


In order to verify the ability of the CTO ontology to express time information in the clinical field, we 
selected a set of case texts of people with a mental health condition. We used an anonymized realistic 
dataset from a hospital in China. We randomly selected 300 patients, which contained approximately 3000 
Chinese statements. And then, we arranged annotators to annotate the time information and conducted a 
quantitative analysis using the Inter-Annotator Agreement (IAA) [18] metric. IAA metric is commonly used 
to evaluate annotation consistency and the difficulty of annotation, usually focusing on the following 
metrics: Precision, Recall, and F1. At the same time, we analyzed the characteristics of other ontologies. 
And we also compared and analyzed the expressiveness of CTO with other ontologies in terms of complex 
time and temporal relations. 


6.1 Evaluation Results 


For IAA, three annotators were asked to annotate time information in 300 patient cases jointly. The 
evaluation consists of two parts, (1) evaluation of classes and object properties (without temporal relations), 
and (2) evaluation of temporal relationships. For evaluation of classes and object properties, they first 
annotated classes, and the annotation methods were discussed together. After that, they discussed and 
agreed on the Gold Standard. The annotation of object properties was same, and the base data is the 
Gold Standard. 


The mean value of F7 metric of time-related classes annotation were 77.18% and 83.06% (exact mapping 
and partial mapping). And the average F7 metric of object properties (without temporal relationship object 
properties) was 87.34%. Considering that complex time information can be represented in various ways, 
such as the representation of time interval can use startTime/endTime + durationQuantity or startTime + 
endTime. If such errors are ignored, the average score is 93.61%. 


For evaluation of temporal relations, we asked three annotators to annotate the temporal relations 
between time entities, using the Gold Standard obtained by previous sub-process. We had developed a 
reasoning machine for the experiment about evaluating of temporal relations, and regarded the results as 
Gold Standard for the second evaluation task. Note that we treat pairs of tokens that have the inverse 
property as true positive as well, such as <time1> before <time2> and <time2> after <time1>. The mean 
value of Precision, Recall and F1 are 96.98%, 91.97%, 94.39% respectively. We also calculate the three 
metrics of each temporal relationship to analyze which temporal relations are more difficult to express. The 
results are shown in Figure 10. 
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Figure 10. IAA results for CTO temporal relations. 


From Figure 10, it can be seen that it is easy to reach a consensus on the representation of simple 
relations, such as after, overlap, endBeforeEnd, etc. However, there are difficulties in representing more 
complex temporal relations, such as containAll, finishPoint, etc. After analysis, it is found that manual 
annotation will show omissions for this type of data but still maintain a high F7 score, which indicates that 
the CTO ontology can also achieve better results in the representation of temporal relations. 


Concerning the ability of ontologies to cover temporal instances, we found that CTO ontologies 
are able to cover almost all types of temporal descriptions. As can be seen from the Figure 11 and 
Figure 12, instances of complex time types (PeriodicinstantCollection, PeriodicIntervalCollection, 
IrregularInstantCollection, IrregularIntervalCollection) account for approximately 16.77% (594 of 3542) of 
all time types, and the extended temporal relationships (containAll, containSome, containNull, startPoint, 
finishPoint) account for 21.19% of all temporal relationships (629 of 2969). We provided 2 sample texts 
from clinical records as well as their relevant knowledge graphs annotated with CTO in Appendix B. 
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= Absolutelnstant 


2% 1% s Relativelnstant 
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Figure 11. Distribution of Time Class. 


= Fuzzylnstant 
* Absoluteinterval 


s Relativelnterval 


s Fuzzyinterval 

= Incompleteinterval 
PeriodicinstantCollection 

s PeriodicintervalCollection 

« IrregularinstantCollection 


IrregularintervalCollection 


= before 

s equal 

a finish 

= start 

= contain 

= containAll 

= containSome 
= containNull 

= endBeforeEnd 
a endEqualEnd 
s startBeforeStart 
= startEqualStart 
= finishPoint 


Figure 12. Distribution of temporal relation object properties. 


6.2 Expressiveness Analysis 


In this section, OWL-Time, PSI-Time, TEO ontology are compared with the CTO ontology on their ability 
to represent the eight problems presented in Section 3. Table 5 shows the result, and we use mark V for 
those ontologies capable of getting problem’s results by querying. 
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Table 5. Comparison of different temporal ontologies about complex time expressiveness. 


Problem 
ioy 1 2 3 4 5 6 7 8 
OWL-Time y y 
PSI-Time V V V V 
TEO y y y y 
CTO V V V V V V V V 


For Problem-1 (Absolute Time Problems), all of them can be queried to obtain the exact time, even the 
hour, minute and second. 


For Problem-2 (Lack Information Time Problems (Relative Time and Incomplete Time)), OWL-Time mainly 
focused on absolute instant and interval, and it cannot represent relative quantities and can certainly not 
represent incomplete time, as TEO said. If we want to obtain the time referred to by “three days before 
hospitalization,” then we need to implement a representation of relative quantities as well as reference 
objects, but they do not have this capability and therefore cannot answer questions relating to such times. 


For Problem-3 (Fuzzy Time Problems), when we want to compare the temporal relationships between 
events that occurred more ambiguous times, it is inappropriate to compare their absolute times. For example, 
the time relationship between time A “at about 8:10 a.m. on November 1, 2021” and time B “at 8 a.m. 
on November 1, 2021, which lasted about 9 minutes” could be either during, after, or finishPoint. It, of 
course, is related to the range of fuzzy values. By introducing the maximum and minimum values as left 
and right anchors, we can compare them in terms of time relationships and obtain a range of values for 
the end time. OWL-Time and PSI-Time do not consider the concept of fuzzy time and therefore cannot be 
queried for answers; TEO does take into account fuzzy time, but it only marks time by True and False and 
does not analyze it quantitatively. 


For Problem-4 (Cyclic Time Problems), we can query PSI-Time, TEO and CTO ontologies to obtain the 
start time, each occurrence and the frequency of the periodic time to answer questions, such as “When is 
the next check?” “How many times has it been done so far?” etc. 


For Problem-5 (Irregular Time Problems), one of the characteristic points of the CTO ontology is that an 
item of the /rregularCollection can still be a collection. Thus, it is possible to implement a stage-by-stage 
representation of time, e.g., “once a day for the first session, once a week for the second session,” etc. Not 
only can we obtain internal time-related calculations such as “How long between the first and second 
sessions?” and “Which session is now?”, but we can also compare the time relationship with other times. 


For Problem-6 (Subjective Problems), we use True or False flag, which is the value of the subjective 
property, to provide a basis for the user to refer to when reasoning or querying results. 


For Problem-7 (Temporal Relation Problems), all of them can get the answer. Both TEO and CTO 
ontologies extend the temporal relations and CTO ontology defines different temporal relations between 
multiple types of time. 
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For Problem-8 (Negation Problems), CTO ontology regards case data as a closed world, and if there does 
not exist the relation r between the time a and b, then (a, not r, b) is satisfied. Thus, the CTO ontology can 
answer that question based on the CWA. 


Overall, CTO ontology is superior to other time ontologies in terms of answering time-related clinical 
questions. 


7. CONCLUSION AND FUTURE WORK 


In this paper, we identified eight time-related problems in clinical domain, and proposed the CTO 
ontology with a taxonomy of complicated temporal concepts to solve these problems, which covers the 
fuzziness, relativity, irregular and cyclic aspects of time expressions in clinical text. In addition, we also 
marked whether the time is subjective so that we can deal with it in the future. Simultaneously, we defined 
the concept description of the complex time-related classes, and extended Allen’s and TEO’s temporal 
relationship to support temporal relations for complex time. More importantly, we give definitions of the 
various temporal relationships between all time-related types (/nstant, Interval, Collection), both in formulaic 
and picture form. The experimental results show that complex time classes account for 16.77% of all classes 
in clinical medical texts and the extended temporal relationships account for 21.19% of all temporal 
relations, and CTO ontology can represent more than 93% of temporal information and temporal 
relationships. In summary, we proposed a robust temporal ontology capable of expressing a variety of 
complex temporal information in clinical texts. 


The CTO ontology can be improved or extended in the following aspects: 1) The reasoning about fuzzy 
time is currently using the way of setting left and right anchor, and the problem of- selecting left and right 
anchor points is also tricky. In the future, we hope to use probabilistic methods to achieve probability-level 
representation and reasoning capabilities; 2) Building a patient history knowledge graph using CTO ontology 
automatically is also on the agenda. By this way, we can further validate and improve its use in realistic 
scenarios. 
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APPENDIX A. SEMANTICS FOR THE AUXILIARY FUNCTIONS IN TEMPORAL RELATION 
DEFINITION 


{a first,a second,---,an ,a..drregularCollection 


th 
Allltems(a.Time) = 4 {0 < i < n| a start-time+ixa periodic-quantity ® a each-time} ,a.PeriodicCollection 
{a} ,a.Instant U a.Interval 


N CalTime(a prefix-anchor,a relativ-quantity) ,„a.Relativelnstant 
CalTime(a.Instant, q.Quantity) = 


at+q ,a.Absolutelnstant 
CalTime(a,@) ,a.lnstant 
CalTime(a start-time) ,a.Jlnterval 


StartTime(a.Time) = 
StartTime(CalTime(a start-time ® a each-time,®))_ „a.PeriodicCollection 


StartTime(CalTime(a first, ©)) „a.IrregularCollection 


Allltems(a.Collection) function can get all the elements in Collection_instance, which is convenient for 
calculation, especially with cyclic collections, which need to be calculated to get the real time of each 
occurrence. Ca/lTime(a./nstant, q.Quantity) function can get the calculated real time point of relative time 
or result of temporal addition. StartTime(a.Time) function provides an interface for obtaining the start time 
of all time types, which is similar to EndTime(a. Time) function. 


APPENDIX B. USAGE EXAMPLES FOR TEMPORAL REPRESENTATION AND QUERY 


Figure B.13-B.16 illustrate the details, in which the words in red italic are manually annotated as events 
and times. We use lines with arrows to establish a correspondence between statements containing time 
information and the class to which that time information belongs. 


Patient Id: 11056 Absolutelnstant Relativelnterval AbsoluteInterval 
Age: 53 t 
Hospitalization Time:[20100602] 


Discharge Time:(20100610 
Main Complaint: Persistent scizures|in the last | week] aggravated with bchavioral impulsivity, and severe \llncss. 
Present Medical History: The patient developed insomnia, talkativeness, emotional instability in{July 1976 without any 


apparent cause, which lasted for one monthjand resolved on its own without any treatment. [Between early 2004 and _| 
early 2006 the patient was taking Vistone 4 mg/d consistently. [On January 3, , he was admitted to another hospita 
by his family and started taking medication, then stopped taking medication on his own for an unknown period of time. 


The patient‘s family seeks further treatment for the patient, today, he was admitted to our hospital for the second time 


with "bipolar disorder". 


| IncompleteInterval | 


Figure B.13. Sample text which Includes Absolute Instant Time, Absolute Interval Time, Relative Time, Fuzzy Time, 
and Incomplete Time. 
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PeriodicInstantCollection 
TrregularIntervalCollection 


The patient has been aggravated jin the past week, with daily] flushing, irritabilivy, fussiness, sleepless nights, and 
shouting [affer 5 pm.](Record Time i is.20100607)j The patient had been quarreling, not eating, and breaking things on 
:00 p.m. and until around 5:00 p.m] The patient's family said that 


after being hospitalized on[Scptember T, 2007] he hay an opcration and was discharged in [October T, 2007. _] 


AbsoluteInstant 


Figure B.14. Sample context which includes Cyclic Time, Irregular Time, Subjective Time, and apparent Temporal 
Relation. 


Pi se haere ade ——- another 
family to receive medication and then stopped m 
on his own for an unknown period 


Z dischargeEs “a Ses = 


Figure B.15. Representation for the text shown in FigureB.13 using CTO. 
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"on September 3, around 2:00 
pm until around 5:00 pm” 


been aggravated in = é R 
> "on September 1, around 2:00 
hep penampen pm until around 5:00 pm" 


baste 

i } C. 

i < absoluteinstant2 / = 
a” f 


comment day TEN - 
rene Ges eu E avs ies amma » discharged in 
"Record Time:20100607" 
legend: [_value | | p< absolute instar nt 
Í EER 
Cm > Cake D> 


Figure B.16. Representation for the text shown in FigureB.14 using CTO. 


Figure B.13 is an example case data, which contains some events (e.g.,hospitalization, discharge, 
medication, and paroxysm events) and some types of time (e.g.,absolute instant time, absolute interval time, 
relative time, fuzzy time, and incomplete time). Figure B.15 illustrates the representation. 


Figure B.13 and Figure B.15 present the time representation for problems 1-3. For problems 4-7, we find 
some short sentences from the dataset. Figure B.14 shows the corpora, and Figure B.16 illustrates the 
representation. 
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